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WHILE MOVING A FIXED LENGTH WINDOW ALONG A 
CLUSTER CONSENSUS SEQUNCE, SEARCH AN INPUT 
SEQUENCE HAVING AN EXACT MATCHING WITH A LENGTH 
OF S USING A FIXED LENGTH PARTIAL SEQUENCE TABLE 



COMPARE THE CLUSTER CONSENSUS SEQUENCE WITH THE 
INPUT SEQUENCE HAVING AN EXACT MATCHING WHOSE 
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SEQUENCE A: 

PARTIAL SEQUENCE Al PARTIAL SEQUENCE A2 



SEQUENCE B: 101 

PARTIAL SEQUENCE B 1 PARTIAL SEQUENCE B2 
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PARTIAL 
SEQUENCE 



INPUT 
SEQUENCE 



Al 



A2 



Bl 



B2 



CI 



C2 



103 



401 



400 



500 



450 



4/16 



YASUDA et al. 



FIG.5 



P 

a 
m 

W 
m 



\ 

N 
G 

n 



105 

Vn 






5( 


)1 




104 














502 


M:;:::|::;;;!:!:i:l:!:l: 






5 


< 7 > 



5/16 



YASUDA etal. 



FIG.6 



CLUSTER 1 



CLUSTER 2 



CLUSTER 1 



UNPROCESSED 



CLUSTER 2 



CLUSTER 2 



UNPROCESSED 



UNPROCESSED 



CLUSTER 2 



UNPROCESSED 



CLUSTER 1 



UNPROCESSED 



UNPROCESSED 




CONTINUE PROCESSING 
WHILE UNPROCESEED 
INPUT SEQUENCES REMAIN 



6/16 



FIG.7 
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SEQUENCE A: 
Al A2 

p a eh — ^ a mm 



SEQUENCE B: 

Bl B2 B3 B4 B5 B6 

E3 g m nam 



SEQUENCE C: 

CI C2 C3 C4 C5 C6 

in Em ei ra a 
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- SELECTION OF FIXED LENGTH PARTIAL SEQUENCE POSITION 

THE NUMBER OF FIXED LENGTH PARTIAL SEQUENCES (K): _6^^ 10 ° 2 
UPPER LIMIT OF DISTANCE FROM END OF SEQUENCE (R): _63_/" /10 ° 3 
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SETTING OF FIXED LENGTH PARTIAL SEQUENCE LENGTH " 

UPPER LIMIT OF EXPECTED VALUE OF THE NUMBER OF / 1008 
COINCIDENTAL MATCHING OCCURRENCE (c): 0.125 

FIXED LENGTH PARTIAL SEQUENCE LENGTH (s): 13 — 1__ 1009 
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— TOTAL PROCESSING STATUS 

THE NUMBER OF PROCESSED 
SEQUENCES: 654,321 y_ 1 102 0 % 
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[— SEQUENCE COMPARING STATUS 

SEQUENCE WITHIN AN EXACT MATICHING 
AREA HAVING A LENGTH OF S: ATGGC 

OVERLAPPING LENGTH: 123 ^\ 
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— TOTAL PROCESSING RESULT 

THE NUMBER OF INPUT 
SEQUENCE: 654,321 

AVERAGE CLUSTER SIZE: 1 1.5 



THE NUMBER OF PRODUCED 
CLUSTER: 56,789 
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SEQUENCE ASSEMBLING STATUS 

OVERLAPPING LENGTH: 123 1209 

SEQUENCE WITHIN AN EXACT MATCHING AREA - 
HAVING A LENGTH OF S: ATGGC 



.1210 



— SELECTION OF CLUSTER TO BE DISPLAYED 

INPUT SEQUENCE: ^/ l2U | DISPLAY \ ^ 1212 

CLUSTER NUMBER: 



r 



| DISPLAY 1214 



12/16 



YASUDA et al. 



FIG. 13 



1301 




13/16 



YASUDA et al. 



FIG. 14 



1401 



1 



G 



c/ 



FIG. 15 



G 



c/ 



V 

1501 



14/16 



FIG. 16 



A 


T 


G 


C 



FIG. 17 

1701 



A 


T 




G 


C 




A 


T 


A 


G 


C 



15/16 



YASUDA et al. 



FIG. 18 




-103 



FIXED LENGTH PARTIAL SEQUENCE TABLE 



PROGRAM FOR SORTING SEQUENCES 



PROGRAM FOR CONSTRUCTING THE FIXED 
LENGTH PARTIAL SEQUENCE TABLE 
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PROGRAM FOR SEARCHING FOR AN INPUT 
SEQUENCE WHICH HAS A PARTIAL 
SEQUENCE MATCHING WITH A 
CONSENSUS SEQUNCE 
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PROGRAM FOR RECONSTRUCTING THE 
CONSENSUS SEQUENCE 
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